A Conceptual Analysis of Standard Setting in Large - Scale Assessments
نویسنده
چکیده
Elements of arbitrariness in the standard setting process are explored, and an alternative to the use of cut scores is presented. The first part of the paper analyzes the use of cut scores in large-scale assessments, discussing three different functions: (1) cut scores define the qualifications used in assessments; (2) they simplify the reporting of achievement distributions; and (3) they allow for the setting of targets for such distributions. The second part of the paper gives a decision-theoretic alternative to the use of cut scores and shows how each of the three functions identified in the first part can be approached in a way that may reduce some of the arbitrary nature of standard setting processes. The third part of the paper formulates criteria for standard setting methods that can be used to evaluate their results. (Contains six figures and eight references.) (Author/SLD) ********************************************************************** Reproductions supplied by EDRS are the best that can be made * from the original document. *********************************************************************** A Conceptual Analysis of 00 (-.9) Standard Setting in Research Large-Scale Assessments Report 94-3 U.S. DEPARTMENT OF EDUCATION Once ot Educahonat Research nd Improvement EDUCMIONAL RESOURCES INFORMATION CENTER (ERIC) dsu document has been reoroduceo as rec.ned Nom the parson or orpernzabon OrtgonePng C Mmot changes have been made to onprOne reetOdoCOOn quality Potnts 01 v41.4 0 prontonS stated .n th,s ooc ment do not necessaNy ,ebresent otfictal OEM oose.on or pohcy PERMISSION TO REPRODUCE THIS MATERIAL HAS BEEN GRANTED BY TO THE EDUCATIONAL RESOURCES INFORMATION CENTER (ERIC). Wim J. van der Linden
منابع مشابه
Standard setting in medical education: fundamental concepts and emerging challenges
The process of determining the minimum pass level to separate the competent students from those who do not perform well enough is called standard setting. A large number of methods are widely used to set cut-scores for both written and clinical examinations. There are some challenging issues pertaining to any standard setting procedure. Ignoring these concerns would result in a large dispute ...
متن کاملValidity and Reliability Assessments of a 16-item Food Frequency Questionnaire as a Probiotic and Prebiotic Consumption Scale in People Aged 20 to 40 Years in Tehran
Background and Objectives: Regarding health effects of probiotics and prebiotics in prevention and control of diseases and the lack of standard questionnaires in this field in Iran, objective of the present study was to assess validity and reliability of questionnaires designed to assess validity and reliability of probiotics and prebiotics in individuals aged 20–40 years in Tehran, Iran. Mate...
متن کاملI-4: External Quality Assessment - A Necessity in The Andrology Laboratory
Andrology laboratories need to produce reliable results for appropriate diagnostic and health care decisions. Since semen analysis is highly complex and procedurally difficult to standardize, quality control (QC) is essential to detect and correct systematic errors and high variability of results. The large discrepancies between assessments of sperm concentration and morphology in different lab...
متن کاملPolitical Consensus Through Setting International Accounting Standards Case of Ias 22
The study aims to reveal that a valid and effective standard, which formerly issued, may reflect the needs and expectation of few interested powerful bodies due to the neutral comments and dearth of lobbyists in the process of setting the standard. The research investigates the working of the IASB, by exploring the standard setting process specifically in relation to the standard on busines...
متن کاملInterpreting the Validity of a High-Stakes Test in Light of the Argument-Based Framework: Implications for Test Improvement
The validity of large-scale assessments may be compromised, partly due to their content inappropriateness or construct underrepresentation. Few validity studies have focused on such assessments within an argument-based framework. This study analyzed the domain description and evaluation inference of the Ph.D. Entrance Exam of ELT (PEEE) sat by Ph.D. examinees (n = 999) in 2014 in Iran....
متن کامل